CDS

Accession Number TCMCG075C00843
gbkey CDS
Protein Id XP_007047556.2
Location complement(join(3611869..3612019,3612351..3612423,3612867..3613026,3613310..3613421,3614021..3614118,3614234..3614657,3615317..3615468,3615718..3615926,3616227..3616314,3616396..3616752))
Gene LOC18611300
GeneID 18611300
Organism Theobroma cacao

Protein

Length 607aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007047494.2
Definition PREDICTED: probable amino-acid acetyltransferase NAGS1, chloroplastic [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category E
Description Amino-acid acetyltransferase
KEGG_TC -
KEGG_Module M00028        [VIEW IN KEGG]
KEGG_Reaction R00259        [VIEW IN KEGG]
KEGG_rclass RC00004        [VIEW IN KEGG]
RC00064        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K14682        [VIEW IN KEGG]
EC 2.3.1.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00220        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01130        [VIEW IN KEGG]
ko01210        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00220        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01130        [VIEW IN KEGG]
map01210        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGATGGCCGCTTCATATTCAACGGCTCGCGTCCCTCTCTTCTCTCCCGCTCGAACCAAACTCTTATCATCCCGCCACGGCTTCAAAAAGGGCGTCGTAAAACTAAAACCCGACTTGAAGTGCCGGGCTCAGTCTCTTAAACCGGAACCCGGGTCCAAGCGAGGTGATTCTGTCAAGCGCAATGTGATTAACGATGAAGATAGCGTGGAGGAGACTTACAACACCGTCGACGATAAGCAGTTCGTGCGGTGGTTCCGCGAGGCTTGGCCTTACCTCTGGGCCCATCGCGGCAGCACTTTCGTTGTTATTATTTCCGGCGAAATCGTCGCTTCCCCCTCTTTGGACGCCATTTTAAAGGATATTGCGTTTTTGCATCACCTAGGAATCAGATTTGTTATTGTTCCAGGAACTCACGTGCAGATCGACAAGCTTTTGGCCGAGAGAGACCATGAACCAAAGTATGTAGGCAGATATAGAATTACAGACTCAGAATCTCTAGCTGCAGCAATGGAAGCAGCAGGAGGGATTCGTCTAATGATAGAGGCAAAACTTTCTCCTGGACCTTCCATATGTAATATCCGTCGACATGGTGATAGTAGCCGTTGGCATGAAGTTGGTGTCAGTGTTGCTAGTGGAAACTTCCTTGCAGCTAAGAAAAGAGGAGTTGTTGAAGGTGTTGATTATGGAGCAACAGGTGAAGTAAAGAAGGTAGATGTTGCTCGCATGCGTGAGAGGCTTGACGGTGGTTGTATAGTAATATTAAGCAACCTGGGGTATTCTAGCTCTGGAGAAGTTTTGAATTGCAACACATATGAAGTTGCTACTGCTTGTGCATTAGCTATTGGAGCAGATAAGCTGATTTGCATTATAGATGGTCCAATTTTGGATGAGAATGGACGCCTTATTAATTTCTTGCCTCTTCAAGAAGCAGATATGTTAATCCGTCAACGGGCTAAGCAAAGCGAGACAGCAGCTAAATATGTGAAAGCTGTTGATGAAGAAGATGTCACTTGCCTTGGACATTATGATTCTATTGCAGTTGTCCCCTCTTCACAGAATGGGAAGGTTCTTAATAGTACACACAATCCAACCTTTCAGAATGGTGTTGGTTTTGATAATGGCAATGGACTATGGTCTGGAGAGCAGGGCTTTGCTATTGGAGGTCAGGAGCGGCTAAGTCGACTAAATGGCTACCTTTCAGAGTTGGCTGCTGCCGCTTTTGTCTGCAGAGGTGGTGTCCAAAGAGTTCATTTGTTAGATGGCACTATTGGTGGGGTCTTATTATTGGAACTGTTCAAAAGAGATGGAATGGGGACAATGGTGGCCAGTGATCTATATGAAGGTACCCGGATGGCGAAGGTGATGGATCTCTTAGGTATCAAGCAAATCATACAACCTTTAGAAGAGTCTGGCACATTGGTTTGCAGGAGTGATGAGGAGCTACGTAAGGCCATAGATTCATTTGTTGTTATGGAAAGGGAAGGTCAAATCGTTGCTTGTGCTGCTCTTTTTCCTTTTTTCAAGGACAAGTGTGGGGAAGTTGCTTGTATTGCAGTTTCTCCTGAATGCCGAGGACAAGGACAGGGAGACAAATTACTTGATTACGTAGAGAAGAAGGCATCATCCCTTGGATTGGATATGCTTTTCCTGCTGACAACCCGTACTGCTGATTGGTTTGTTAGGCGCGGCTTCGAAGAATGTACCATTGACATGATACCAGATGAAAGGAGGAAAAAGATCAATCTATCCCGTAAATCCAAGTATTACATGAAGAAGTTGCTACCGGATCGAAGTGGAATTACTGCTGATAGAGCATTTAAATGA
Protein:  
MMAASYSTARVPLFSPARTKLLSSRHGFKKGVVKLKPDLKCRAQSLKPEPGSKRGDSVKRNVINDEDSVEETYNTVDDKQFVRWFREAWPYLWAHRGSTFVVIISGEIVASPSLDAILKDIAFLHHLGIRFVIVPGTHVQIDKLLAERDHEPKYVGRYRITDSESLAAAMEAAGGIRLMIEAKLSPGPSICNIRRHGDSSRWHEVGVSVASGNFLAAKKRGVVEGVDYGATGEVKKVDVARMRERLDGGCIVILSNLGYSSSGEVLNCNTYEVATACALAIGADKLICIIDGPILDENGRLINFLPLQEADMLIRQRAKQSETAAKYVKAVDEEDVTCLGHYDSIAVVPSSQNGKVLNSTHNPTFQNGVGFDNGNGLWSGEQGFAIGGQERLSRLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRMAKVMDLLGIKQIIQPLEESGTLVCRSDEELRKAIDSFVVMEREGQIVACAALFPFFKDKCGEVACIAVSPECRGQGQGDKLLDYVEKKASSLGLDMLFLLTTRTADWFVRRGFEECTIDMIPDERRKKINLSRKSKYYMKKLLPDRSGITADRAFK